Extremal problems for ordered (hyper) graphs: 
applications of Davenport-Schinzel sequences 



Martin Klazar* 



Abstract 

We introduce a containment relation of hypergraphs which respects 
linear orderings of vertices and investigate associated extremal func- 
tions. We extend, by means of a more generally applicable theorem, 
the n log n upper bound on the ordered graph extremal function of F = 
({1,3}, {1,5}, {2, 3}, {2, 4}) due to Fiiredi to the n(logn) 2 (loglogn) 3 
upper bound in the hypergraph case. We use Davenport-Schinzel se- 
quences to derive almost linear upper bounds in terms of the inverse 
Ackermann function a(n). We obtain such upper bounds for the ex- 
tremal functions of forests consisting of stars whose all centers precede 
all leaves. 



1 Introduction and motivation 

In this article we shall investigate extremal problems on graphs and hyper- 
graphs of the following type. Let G = ([n], E) be a simple graph that has 
the vertex set [n] = {1,2, ... ,n} and contains no six vertices 1 < V\ < v 2 < 
• • • < v 6 < n such that {vi, v 3}, {v±, v 5}, {v 2, V4}, and {v 2, vq} are edges of G, 
that is, G has no ordered subgraph of the form 



G^= /^T^W . (1) 
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Determine the maximum possible number g(n) = \E\ of edges in G. 

What makes this task hard is the linear ordering of V = [n] and the fact 
that Go must not appear in G only as an ordered subgraph. If we ignore 
the ordering for a while, then the problem asks to determine the maximum 
number of edges in a simple graph G with n vertices and no 27^2 subgraph, 
and is easily solved. Clearly, if G has two vertices of degrees > 3 and > 5, 
respectively, or if G has > 6 vertices of degrees 4 each, then a 2K 12 subgraph 
must appear. Thus if G has a vertex of degree > 5 and no 2Ki t2 subgraph, 
it has at most (2(n — 1) + n — l)/2 = 3n/2 — 1.5 edges. If all degrees are 
< 4, the number of edges is at most (3(n — 5) + 4 • 5)/2 = 3n/2 + 2.5. On 
the other hand, the graph on [n] in which deg(n) = n — 1 and [n — 1] induces 
a matching with [(n — 1)/2J edges has n + [(n — 1)/2J — 1 edges and no 
2Ki 2 subgraph. We conclude that in the unordered version of the problem 
the maximum number of edges equals 3n/2 + 0(1). 

The ordered version is considerably more difficult. Later in this section 
we prove that the maximum number of edges g(n) satisfies 

n ■ a(n) < g(n) < n ■ 2 ( - 1+ ° W)a{n)2 (2) 

where a(n) is the inverse Ackermann function. Recall that a(n) = min{m : 
A(m) > n} where A(n) = F n (n), the Ackermann function, is defined as 
follows. We start with Fi(n) = 2n and for i > 1 we define F i+ i(n) = 
Fi(Fi(. . . Fi(l) . . .)) with n iterations of F,. The function a(n) grows to infin- 
ity but its growth is extremely slow. We obtain (J2J and some generalizations 
by reductions to Davenport-Schinzel sequences (DS sequences). We continue 
now with a brief review of results on DS sequences that will be needed in 
the following. We refer the reader for more information and references on 
DS sequences and their applications in computational and combinatorial ge- 
ometry to Agarwal and Sharir [I], Klazar ^7j, Sharir and Agarwal and 
Valtr 

If u = aia2 . . . a r and v = b\bi . . . b s are two finite sequences (words) 
over a fixed infinite alphabet A, where A contains N = {1,2, . . .} and also 
some symbols a,b,c,d, . . ., we say that v contains u and write v y u if v 
has a subsequence b^b^ . . . b ir such that for every p and q we have a p = a q 
if and only if b ip = b iq . In other words, v has a subsequence that differs 
from u only by an injective renaming of the symbols. For example, v = 
ccaaccbaa y 22244 = u because v has the subsequence cccaa. On the other 
hand, ccaaccbaa )f 12121. A sequence u = aid 2 . . . a r is called k-sparse, 
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where k G N, if cij = aj, i < j, always implies j — % > k; this means that 
every interval in u of length at most k consists of distinct terms. The length 
r of u is denoted by \u\. For two integers a < b we write [a, b] for the interval 
{a, a + 1, . . . , b}. For two functions /, g : N — > R the notation / <C g is 
synonymous to the / = 0(g) notation; it means that \f(n)\ < c\g(n) \ for all 
n > n with a constant c > 0. 

The classical theory of DS sequences investigates, for a fixed sGN, the 
function X s (n) that is defined as the maximum length of a 2-sparse sequence 
v over n symbols which does not contain the s + 2-term alternating sequence 
ababa ... (a ^ b). The notation X s (n) and the shift +2 are due to historical 
reasons. The term DS sequences refers to the sequences v not containing 
a fixed alternating sequence. The theory of generalized DS sequences in- 
vestigates, for a fixed sequence u that uses exactly k symbols, the function 
ex(u, n) that is defined as the maximum length of a fc-sparse sequence v such 
that v is over n symbols and v )/- u. Note that ex(u,n) extends X s (n) since 
X s (n) = ex(ababa . . . , n) where ababa . . . has length s + 2. In the definition 
of ex(u, n) one has to require that v is fc-sparse because no condition or even 
only k — 1-sparseness would allow an infinite v with v )f- u; for example, 
v = 12121212 ...)/- abca = u and v is 2-sparse (but not 3-sparse). An easy 
pigeon hole argument shows that always ex(u,n) < oo. 

DS sequences were introduced by Davenport and Schinzel [7] and strongest 
bounds on X s (n) for general s were obtained by Agarwal, Sharir and Shor 
[2]. We need their bound 

X 6 (n) < n ■ 2( 1 +°( 1 » Q (") 2 (3) 

(recall that X§(n) = ex(abababab,n)). Hart and Sharir ^3] proved that 

na(n) <C X 3 (n) <^ na(n). (4) 

In Klazar ^3] we proved that if u is a sequence using k > 2 symbols and 
\u\ — I > 5, then for every jigN 

ex(u, n) < n • A;2^ 3 ■ (ioA;) 2 ^'-^ 8 ^" 5 . (5) 

It is easy to show that for k = 1 or / < 4 we have ex(u, n) -C n. In particular, 
for the sequence 

u(k,l) = 12. . . k!2. . . k. . . 12. . . k (6) 
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with Z segments 12 ... k we have, for every fixed k > 2 and I > 3, 

ex(u(ife, /), n) < n ■ k2 kl ~ 3 ■ (iQ k f a ^ kl ~ A+ ^ kl ~\ (7) 

We denote the factor at n in (JJJ) as /3(k,l,n). Thus 

Z, n) = fc2 fc/ - 3 (10A;) 2a W +8 "W" 5 . (8) 

Let us see now how Q and the lower bound in (j3J) imply (J2J). Let G = 
be any simple graph not containing Go (given in as an ordered 
subgraph. Consider the sequence 

v = hh ...I n 

over [n] where U is the decreasing ordering of the list {j : G E & j < i}. 

Note that h = and |v| = 

Lemma 1.1 If v >- abababab then Gq is an ordered subgraph of G. 

Proof. We assume that v has an 8-term alternating subsequence 

. . . di . . . b\ . . . a,2 ■ ■ ■ hi . . . a-i . . . 63 . . . 04 . . . &4 . . . 

where the appearances of two numbers a 7^ b are indexed for further dis- 
cussion. We distinguish two cases. If a < b then a 2 , b 2 , a^, and 64 lie, 
respectively, in four distinct intervals I p , I q , I r , and I s , p < q < r < s, (since 
every Jj is decreasing) and b < p (since b x precedes a 2 ). Hence G is an 
ordered subgraph of G. If b < a then b\, a 2 , 63, and a 4 lie, respectively, in 
four distinct intervals I p , I q , I r , and I s , p < q < r < s, and a < p. Again, Gq 
is an ordered subgraph of G. □ 

Thus v has no 8-term alternating subsequence. In v immediate repetitions 
may appear only on the transitions Iih+i. Deleting at most n — 1 (actually 
n — 2 because I\ = 0) terms from v we obtain a 2-sparse subsequence w on 
which we can apply @. We have 

\E\ = \v\ <\w\+n-l< A 6 (n) +n- 1 < n ■ 2 (1+o(1))a(n)2 . 

On the other hand, let n G N and v be the longest 2-sparse sequence 
over [n] such that v )/- ababa. It uses all n symbols and, by the lower bound 
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in (HJ), M > cnot{n) for an absolute constant c > 0. Notice that every 
i G [n] appears in v at least twice. We rename the symbols in v so that 
for every 1 < i < j < n the first appearance of j in v precedes that of 
%\ this affects neither the property v )/- ababa nor the 2-sparseness. By an 
extremal term of v we mean the first or the last appearance of a symbol 
in v. The sequence v has exactly 2n extremal terms. We decompose v 
uniquely into intervals v — I± I2 ■ ■ ■ Im so that every Zj ends with an extremal 
term and contains no other extremal term. Every Jj consists of distinct 
terms because a repetition ... 6 ... 6 ... in Jj would force a 5-term alternating 
subsequence ... a ... b ... a ... b ... a . . . in v. We define a simple (bipartite) 
graph G* = ([Sn],E) by 

{i, j} G E <^=^ i G [n] & j G [n + 1, 3n] & i appears in Ij- n . 

G* has 3n vertices and \E\ = \v\ > cna(n) edges. Suppose that G* contains 
the forbidden ordered subgraph Go on the vertices 1 < a\ < a 2 < . . . < 
a 6 < 3n. By the definition of G*, z = a^a^ is a subsequence of v and 
its terms appear in 7 a3 _ n , . . . , I a6 - n , respectively. Since a 2 > cti, number a 2 
must appear in v before z starts and therefore v contains a 5-term alternating 
subsequence but this is forbidden. So G* does not contain G and shows that 

g{n) ^> na(n). 

This concludes the proof of (0). 

Problem 1.2 Narrow the gap A3(n) g(n) ^ Xe{n) in What is the 
precise asymptotics ofg(n)? 

Our illustrative example with g{n) shows that the ordered version of a 
simple graph extremal problem may differ dramatically from the unordered 
one. Classical extremal theory of graphs and hypergraphs deals with un- 
ordered vertex sets and it produced many results of great variety — see, for 
example, Bollobas 0111, Frankl [0], Fiiredi [TT], and Tuza (201 |2T]. However, 
only little attention has been paid to ordered extremal problems. The only 
systematic studies devoted to this topic known to us are Fiiredi and Hajnal 
[T2*l (ordered bipartite graphs) and Brass, Karolyi and Valtr [H] (cyclically or- 
dered graphs). We think that for several reasons ordered extremal problems 
should be studied and investigated more intensively. First, for their intrinsic 
combinatorial beauty. Second, since they present to us new functions not to 
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be met in the classical theory: nearly linear extremal functions, like na(n) 
or nlogn, are characteristic for ordered extremal problems and it seems that 
they cannot appear without ordering of some sort. Third, estimates coming 
from ordered extremal theory were successfuly applied in combinatorial ge- 
ometry, where often the right key to a problem turns out to be some linear or 
partial ordering, and to obtain more such applications we have to understand 
more thoroughly combinatorial cores of these arguments. 

Before summarizing our results, we return to DS sequences and show 
that the sequential containment -< can be naturally interpreted in terms of 
particular hypergraphs, (set) partitions. A sequence u = a\d2 ■ ■ ■ a r over the 
alphabet A may be viewed as a partition P of [r] where i and j are in the 
same block of P if and only if a; = aj. Thus blocks of P correspond to 
the positions of symbols in u. For example, u = abaccba is the partition 
{{1, 3, 7}, {2, 6}, {4, 5}}. If u = ([r], ~ u ) and v = ([s], ~„) are two sequences 
given as partitions by equivalence relations, then u -< v if and only if there is 
an increasing injection / : [r] — > [s] such that x ~ u y <=}■ f(x) ~„ f(y) holds 
for every x, y G [r]. 

In this article we introduce and investigate a hypergraph containment 
that generalizes both the ordered subgraph relation and the sequential con- 
tainment. The containment and its associated extremal functions ex e (F,n) 
and eXi(F,n) are given in Definitions 12.11 and 12.21 The function ex e (F,n) 
counts edges in extremal simple hypergraphs H not containing a fixed hy- 
pergraph F and the function eXj(F, n) counts sums of edge cardinalities. 
In Theorem 12.31 we show that for many F one has eXj(F, n) <C ex e (F,n). 
Theorem 13.11 shows that if F is a simple graph, then in some cases good 
bounds on ex e (F,n) can be obtained from bounds on the ordered graph 
extremal function gex(F,n). We apply Theorem 13.11 to prove in Theo- 
rem O that for d = ({1,3}, {1,5}, {2, 3}, {2, 4}) one has ex e (G 1 ,n) < 
n ■ (logn) 2 • (loglogn) 3 and the same bound for exj(Gi,n); this generalizes 
the bound gex(Gi,n) n • logn of Fiiredi. In another application, Theo- 
rem E31 we prove that the unordered hypergraph extremal function ex" (F, n) 
of every forest F is n . In Theorem 14. II we generalize the bound JSJ) to hy- 
pergraphs. In Theorem 14.21 we prove that if F is a star forest, then ex e (F,n) 
has an almost linear upper bounds in terms of a(n); this generalizes the 
upper bound in (J2J). In the concluding section we introduce the notion of 
orderly bipartite forests and pose some problems. 

This article is a revised version of about one half of the material in the 
technical report ^H]- We present the other half in [18,. 
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2 Definitions and bounding weight by size 



A hypergraph H = (Ei : i G I) is a finite list of finite nonempty subsets Ej of 
N = {1, 2, . . .}, called edges. H is simple if Ei ^ Ej for every i,j & I, i ^ j. 
H is a <?rap/i if |Ej| =2 for every i £ I. H is a partition if Ej fl Ej = for 
every i, j E I , i ^ j. The elements of U if = Uig/ Ej C N are called vertices. 
Note that our hypergraphs have no isolated vertices. The simplification of 
H is the simple hypergraph obtained from H by keeping from each family of 
mutually equal edges just one edge. 

Definition 2.1 Let E~ = (Ej : i e I) and H' = (E[ : i G I') be two 

hypergraphs. H contains H' , in symbols H >- H' , if there exist an increas- 
ing injection F : \J H' — > U H and an injection /:/'—>/ s«c/i £/ia£ i/ie 
implication 

veE[^ F(v) G E /(i) 

ZioZds /or every vertex v E \JH' and every index i G I'. Else we say that H 
is H'-free and write H )/- H' . 

The hypergraph containment -< clearly extends the sequential contain- 
ment and the ordered subgraph relation. We give an alternative definition 
of -<. H — (Ei : % G I) and H' = (E[ : % G I') are isomorphic if there 
are an increasing bijection F : U H' — > U H and a bijection /:/'—>/ such 
that E(E-) = Ej(j) for every % G E. H' is a reduction of # if E C / and 
E 4 ' C Ej for every % G E. Then H' -< H if and only if E 7 is isomorphic to a 
reduction of E^. We call that reduction of H an H'-copy in E\ For example, 
if H' = ({l}i, {1)2) (H' is a singleton edge repeated twice) then H )/- H' iff 
E^ is a partition. Another example: If H' = ({1, 3}, {2, 4}) then E^ is H'-free 
iff E' has no four vertices a < b < c < d such that a and c lie in one edge of 
H and 6 and d lie in another edge. 

The order v(H) oi H = (Ei : % G I) is the number of vertices w(E") = 
ILJE], the sue e(E r ) is the number of edges e(H) = \H\ = |/|, and the 
weight i(H) is the number of incidences between the vertices and the edges 
i(H) = J2 ie i \Ei\. Trivially v(H) < i(H) and e(H) < i(H) for every H. 

Definition 2.2 Let F be any hypergraph. We associate with F the extremal 
functions ex e (E), eXj(E) : N — > N, defined by 

ex e (F,n) = max{e(E") : H )/- F Sz H is simple & v(H) < n} 
exj(E, n) = max{i(H) : H )f F & H is simple & v(H) < n}. 
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We considered ex e (F,n) and exj(F, n) implicitly already in Klazar [To] . 
Except of this article, to our knowledge, this extremal setting is new and was 
not investigated before. Obviously, for every neN and F, ex e (F, n) < 2™ — 1 
and exi(F,n) < n2 n ~ 1 but much better bounds can be usually given. The 
reversal of a hypergraph H = (Ei : i G /) with N = max((J_FT) is the 
hypergraph H = (E^ : i <E I) where Ei = {N — x + 1 : a; G -Ej}. Thus 
reversals are obtained by reverting the linear ordering of vertices. It is clear 
that ex e (F,n) = ex e (F,n) and exi(F,n) = exi(F,n) for every F and n. 

We give a few comments on Definitions 12 . II and 12 . 21 Replacing the impli- 
cation in Definition ^, ll with an equivalence, we obtain an induced hypergraph 
containment that still extends the sequential containment. Let iex e (F,n) be 
the corresponding extremal function. For F^ = ({1}, {2}, . . . , {k}) and k > 2 
we have iex e (F fc ,n) > U™ 2 ) because the hypergraph (E : i? C [n] &; = 
n — k + 2) does not contain Ffc in the induced sense. But in the hypergraph 
"DS theory" such uncomplicated hypergraphs like Fk should have linear (or 
almost linear) extremal functions. Thus the induced containment is not the 
right generalization, as far as one is interested in " DS theories" . 

For graphs, if H 2 y Hi and H 2 is simple then Hi is simple as well. But 
simple hypergraphs may contain hypergraphs that are not simple. In Def- 
inition 12.21 H must be simple because allowing all H would produce usu- 
ally the value +oo (the simplicity of H may be dropped only for F = 
({l}i, {1)2, . . • , {l}fc)). On the other hand, we allow for the forbidden F 
any hypergraph: F need not be simple and may have singleton edges. Such 
a freedom for F seemed to one referee of ^H] "very artificial". This opinion is 
understandable but the author does not share it. Putting aside psychological 
inertia, there is no reason why to restrict F from the outset to be simple or 
in any other way. On the contrary, doing so we might miss some connections 
and phenomena. Thus we start in the definitions with completely arbitrary 
F and restrict it later only if circumstances require so. 

Another perhaps unusual feature of our extremal theory is that in H and 
F edges of all cardinalities are allowed; in extremal theories with forbidden 
substructures it is more common to have edges of just one cardinality. This 
led naturally to the function eXj(F, n) which accounts for edges of all sizes. 
Trivially, eXj(F, n) > ex e (F,n) for every hypergraph F and n G N. On 
the other hand, Theorem 12.31 shows that for many F one has eXj(F, n) 
ex e (F,n). In Definition 12.21 we take all H with v(H) < n that the extremal 
functions be automatically nondecreasing. Replacing v (H) < n with v (H) = 
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n would give more information on the functions but also would bring the 
complication that then extremal functions are not always nondecreasing. It 
happens for F = ({1}, {2}, . . . , {k}) and we analyze this phenomenon in 

Theorem 2.3 Suppose that the hypergraph F has no two separated edges, 
which means that Ei < E 2 holds for no two edges of F . Let p = v(F) and 
q = e(F) > 1. Then for every n G N, 

eXi{F,n) < (2p-l)(q-l)-ex e (F,n). 

Proof. Suppose that H attains the value exj(F, n). We transform H 
in a new hypergraph H' by keeping all edges with less than p vertices 
and replacing every edge E = {vi,v 2 , ■ ■ ■ ,v s } of H with s > p, where 
V\ < t>2 < ••• < v s , by t = [\E\/p\ new p-element edges {v%, . . . ,v p }, 
{v p+ i, . . . , V2 P }, • • • , {v (t-i) p +i, ■ ■ ■ , v tp }. H' may not be simple and we set 
H" to be the simplification of H' . Two observations: (i) no edge of H' re- 
peats q or more times and (ii) H" is F-free. If (i) were false, there would be 
q distinct edges Ei, . . . , E q in H such that | nf=i Ei\ > p. But this implies 
the contradiction F -< H. As for (ii), any F-copy in H" may use from every 
E G H at most one new edge E" C E (the new edges born from E are 
mutually separated) and so it is an F-copy in H as well. The observations 
and the definitions of H' and H" imply 

e M F,n)=i(H) < ^ = 9 " ^ < < 2 " = M Z D : W 

< (2p-l)(g-l).e(iF') 

< (2p-l)(g-l)-ex e (F,n). 

The last innocently looking inequality follows from the fact that ex e (F, n) is 
nondecreasing by definition. □ 

However, eXj(F, n) ex e (F, n) does not hold for F k = ({1}, {2}, . . . , {k}) 
and k > 2: ex e (Ft, n) = 2 /c ~ 1 — 1 for n > k — 1 and eXj(Ffc, n) = (/c — l)n — (A; — 
2) for n > max(fc,2 fc " 2 ) ((THI HHD- Note that for F = ({1}) both extremal 
functions are undefined. F^ is highly symmetric and the ordering of vertices 
is irrelevant for the containment H y F k . 

Problem 2.4 Prove that if F is not isomorphic to ({1}, {2}, . . . , {k}), k > 
1, then exj(F, n) <C ex e (F,n). 
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3 Bounding hypergraphs by means of graphs 

For a family of simple graphs R and nGNwe define 

gex(R, n) = max{e(G) : G )/- G' for all G' G R & G is a simple graph 
& v(G) < n} 

and for one simple graph G we write gex(G, n) instead of gex({G}, n). Fiiredi 
proved in [TU], see also [12], that for 



G 1 = ({1,3}, {1,5}, {2,3}, {2,4})= I I (9) 

one has 

nlogn -C gex(Gi, n) <C nlogn. (10) 

(In PU] and ^2] the investigated objects are 0-1 matrices, which are ordered 
bipartite graphs, but in the case of G\ the bounds are easily extended to 
all ordered graphs.) Attempts to generalize the upper bound in (fTUj) to 
hypergraphs motivated the next theorem. 

For k G N we say that a simple graph G' is a k-blow-up of a simple graph 
G if for every edge coloring x '■ G' — ► N that uses every color at most k times 
there is a G-copy in G' with totally different colors, that is, x is injective on 
the G-copy. For k G N and a simple graph G we write B (k, G) to denote the 
set of all /c-blow-ups of G. 

Theorem 3.1 Let F be a simple graph with p = v(F) and q = e(F) > 1 and 
let B C B(fy,F). If f : N — > N is a nondecreasing function such that 

gex(£>, n) < n • f{n) 

holds for every n G N ; then 

ex e (F, n)<q- gex(F, n) ■ ex e (F, 2f(n) + 1) (11) 
holds for every n G N, n > 3. 

Proof. Suppose that the simple hypergraph H attains the value ex e (F,n) 
and |J H = [m] , m < n. We put in H' every edge of H with more than 1 
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and less than p vertices and for every F G H with |F| >pwe put in H' an 
arbitrary subset E' G E, \E'\ — p. So 2 < \E\ < p for every E G H' and no 
edge of H' repeats more then q — 1 times, for else we would have H y F. 
Let if" be the simplification of H' . We have 

e(if) <n + (g-l)e(if"). 

Let G be the simple graph consisting of all edges E* such that E* G E for 
some E G H" . Observe that if F' G B and F' -< G, then F -< if" and thus 
F -< H. (For the edges E* £ G forming an F'-copy consider the coloring 
x{E*) = E where E G H" is such that E* G E. Every color is used at most 
(2) times and therefore, since F' is a ^) -blow-up of F, we have an F-copy 
in G for which the correspondence F* 1— > F is injective.) Hence F' ~< G for 
no F' G F. Let = n'\ n' < n. We have 

e(G) < gex(F,n') <n' ■ f{n). 

There exists a vertex v E\JG such that 

d = deg G M<2f(n')<2f(n). 

Fix an arbitrary edge Fq, t> G Fq G G. Let X G [n] be the union of all 
edges F G F" satisfying Eq G E and m be the number of such edges in H" . 
We have the inequalities 

m < ex e (F, \X\) and |X| < d + 1. 

Thus 

m < ex e (F, |X|) < ex e (F, d + 1) < ex e (F, 2/(n) + 1). 

We see that the two-element set Fq is contained in at least 1 but at most 
ex e (F, 2f(n) + 1) edges of H" . Deleting those edges we obtain a subhy- 
pergraph H" of H" on which the same argument can be applied. That is, 
a two-element set E{ exists such that E{ G E for at least 1 but at most 
ex e (F, 2f(n) + 1) edges F G H" and clearly E\ ^ Eq. Continuing this way 
until the whole H" is exhausted, we define a mapping 

M :H" -> {F* : F* C [n], |F*| = 2} 

such that 

M(E)GE and |Af _1 (£7*)| < ex e (F, 2/(n) + 1) 
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holds for every E G H" and E* C [n], \E*\ = 2. Let G' be the simple graph 
G' = M(H") and v(G') = n'; n' < n. 

The containment F -< G' implies, by the definition of G', that F -< H" 
and hence F -< H, which is not allowed. Thus 

e{G') < gex(F,n') < gex(F,n). 

Putting it all together, we obtain (since gex(F, n) > n — 1 if q > 1) 

ex e (F, n) = e(H) < n + (q - 1) • e{H") 

< n + (g-l)-ex e (F,2/H + l)-e(G / ) 

< g-ex e (F,2/(n) + l)-gex(F,n) 

for every n > 3. □ 

We give three applications of this theorem. The first one is the promised 
generalization of the upper bound in (JTUJ). 

We say that a simple graph G is a k-multiple of G\, where k G N and 
Gi is defined in ©, if G has this structure: \JG = AU{v}UBUC with 
v4<t><_B<C, | A | = k, the vertex u has degree k and is connected to every 
vertex in A, every vertex in A has degree 2k + l and is besides v connected to 
k vertices in B and to k vertices in C, and G has no other edges. The edges 
incident with v are called backward edges and the edges incident with vertices 
in B U C are called forward edges. We denote the set of all /c-multiples of G\ 
by M(k). 

Lemma 3.2 The sets of graphs M(k), k G N, have the following properties. 

1. For every k, M(3k+1) C B{k 1 G 1 ). In particular, M(31) C B((fj,Gi). 

2. For every k, gex(M(k),n) nlogn. 

Proof. 1. Let G be a 3fc + 1-multiple of Gi and x '■ G — > N be an edge 
coloring using each color at most k times. We select in G two backward edges 
Ei = {i,v} and E 2 = {j, v}, i < j < v, with different colors. It follows that 
we can select in G two forward edges E 3 = {i, 1} and £4 = {j, I'} such that 
v < I' < I and the colors x(-Ei), . . . , x(E^) are distinct. Edges E\, . . . , E4 
form a Gi-copy on which x is injective. Thus G G -B(/c, Gi). 

2. Let n > 2 and G be any simple graph such that \JG = [n] and G )f F 
for every F G M(k). Let Jj = {.E G G : mini? = i} for z G [n]. In every Jj 
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it is possible to mark LI-^IAJ > \Ji\/k — 1 edges so that every marked edge 
is in Jj immediately followed by k — 1 unmarked ones (we order the edges in 
Ji by their endpoints). The graph G' formed by the marked edges satisfies 

e(G') > e(G)/k-n. 

Also, for every edge {i, j} G G', i < j, there are at least k — 1 edges {i, 1} G G 
with / > j, and for every two edges {i, j}, {i, f} G G', i < j < j', there are 
at least k — 1 edges {i, 1} G G with j < I < j'. Now we proceed as in Fiiredi 
[10J. We say that {i,j} G G', i < j, has type (j,m), where m > is an 
integer, if there are two edges {i, 1} and {i, I'} in G' such that j < I < I' and 
I — j < 2 m < I' — j. Consider the partition 

a = g* u g** 

where G* is formed by edges with at least one type and G** by edges without 
type. It follows from the definition of type and of G' that if k edges of G* 
have the same type, then F -< G for some F G M(k) which is forbidden. 
Thus any type is shared by at most k — 1 edges. Since the number of types 
is less than n(l + log 2 n), we have 

e(G*) < (k - l)n + (k - l)n log 2 n. 

To bound e(G**), we fix a vertex i G [n] and consider the endpoints i < jo < 
ji < ■ ■ ■ < j t _i < n of all edges E G G' which have no type and min E = i. 
Let d r = jr—jr-i fo r 1 < r < t— 1 and D — di + - ■ - + d t _j = jt-i—jo- If d\ — 
D/2, then d\ <2 m < D for some integer m > and the edge {i, jo} would 
have type (jo,fn) because of the edges {i,ji} and {i,j t -i}- Thus di > D/2 
and D — di < D/2. By the same argument applied to {i,ji}, d 2 > {D — d\)/2 

and thus D - d 1 - d 2 < D/A. In general, 1 < D — d\ — d r < D/2 r for 

1 < r < t — 2. Thus t < log 2 D + 2 < 2 + log 2 n. Summing these inequalities 
for all z G [n], we have 

e(G**) < 2n + nlog 2 n. 

Alltogether we have 

e(G) < A;n + fc(e(G*) + e(G**)) < (A; 2 + + fc 2 nlog 2 n. 

We conclude that gex(M(/c),n) ralogn and the constant in depends 
quadratically on k. □ 
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Theorem 3.3 Let G\ be the simple graph given in (OJ). We have the follow- 
ing bounds. 

1. n ■ logn <C ex e (Gi, n) -C n ■ (logn) 2 • (log log n) 3 . 

2. n ■ logn <C exj(Gi, n) -C n • (logn) 2 • (log logn) 3 . 

Proof. 1. The lower bound follows from the lower bound in (jlOj) . To 
prove the upper bound, we use Theorem Kill By 2 of Lemma 13. 2\ we have 
gex(M(31),n) <C nlogn. Also, gex^^n) <C nlogn (by the upper bound 
in (fTUj) or by gex(Gi,n) < gex(£>,n)). By 1 of Lemma l3~2l we can apply 
Theorem 13 . 1 1 wit h B = M(31). Starting with the trivial bound ex e (Gi,n) < 
2 n , (|TT|) with f(n) <C logn gives 

ex e (Gi, n) <C n c 

where c > is a constant. Feeding this bound back to (fTTj) . we get 

ex e (Gi,n) <n - (logn) c+1 . 
Two more iterations of (fTTj) give 

ex e (Gi,n) <C n ■ (logn) 2 ■ (loglogn) c+1 

and 

ex e (Gi,n) <C n ■ (logn) 2 ■ (log logn) 2 • (logloglogn) c+1 

which is slightly better than the stated bound. 

2. The lower bound follows from exj(Gi,n) > ex e (Gi,n). The upper 
bound follows from the upper bound in 1 by Theorem 12.31 □ 

Problem 3.4 What is the exact asymptotics of ex e (Gi, n) ? 

The second application of Theorem 13 . 1 1 concerns unordered extremal func- 
tions eXg(F, n) and gex n (G,n). They are defined as ex e (F, n) and gex(G, n) 
except that in the containment the injection need not be increasing. So 
gex n (G,n) is the classical graph extremal function. It is well known, see for 
example Bollobas jSJ Exercise 24 in IV. 7], that gex'"(F, n) < {e(F) — 1) • n 
for every forest F. We extend the linear bound to unordered hypergraphs. 
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Theorem 13.11 holds also in the unordered case because the proof is indepen- 
dent of ordering. Ordering is crucial only for obtaining linear or almost linear 
bounds on gex(F, n) and gex(£?,n) because the inequality (fTTj) is useless if 
f(n) is not almost constant. The proof of Theorem 13.11 shows also that if 
F is a forest and all members of B are forests (which is not the case for 
B = M(k)) then (j) can be replaced by p — 1 (because for \E\ = p every p 
two-element edges E* C E contain a cycle but no F' G B has a cycle). 

Theorem 3.5 Let F be a forest. Its unordered hypergraph extremal function 
satisfies 

ex"(F,n) < n. 

Proof. Let v(F) = p and e(F) = q > 1 (case q = 1 is trivial). It is not 
hard to find a forest F' with large e(F') = Q — Q < (pq 2 ) q+1 suffices - 
that is a p — 1-blow-up of F. We set B = {F'} and use (fTTj) with the bounds 
gex n (F, n) < (q — l)n, f(n) = Q — 1 (since gex n (B,n) = gex u (F',n) < 
(Q - l)n), and ex^(F,n) < 2 n (trivial): 

ex^(F, n)<q-(q-l)n- 2 2Q ~ 1 = Q4 Q • n. 

□ 

One can prove the bound ex™(F, n) n also directly, without Theorem 13.11 
by adapting the proof of gex u (F, n) < n to hypergraphs. The third applica- 
tion of Theorem 13.11 follows in the next section. 

4 Partitions and star forests 

The bound (0) tells us that if F is any fixed partition with k blocks and 
if is a /c-sparse partition with H )/- F, then v(H) = i(H) has an almost 
linear upper bound in terms of e(H). The following theorem bounds i(H) 
almost linearly in terms of e(H) in the wider class of (not necessarily simple) 
hypergraphs H. The proof is based on (f7j). 

Theorem 4.1 Let F be a partition with p = v(F) and q = e(F) > 1 and H 
be a F-free hypergraph, not necessarily simple. Then 

i{H) < (q - 1) ■ v(H) + e{H) ■ /3(q, 2p, e(H)) (12) 

where f3(k,l,n) is the almost constant function defined in (0). 
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Proof. Let \JH — [n] and the edges of H be E\, E 2 , . . . , E e where e = e(H). 
We set, for 1 < i < n, Si = {j G [e] : i G Ej} and consider the sequence 

v = hl 2 ...I n 

where Jj is an arbitrary ordering of Si. Clearly, v is over [e] and \v \ = i{H). 
The sequence v may not be g-sparse, because of the transitions IJi+i, but it 
is easy to delete at most q — 1 terms from the beginning of every Jj, % > 1, so 
that the resulting subsequence w is g-sparse. Thus |iu| > |u| — (q— l)(n— 1). 
It follows that if w contains u(q,2p), where u(k,l) is defined in JBJ, then H 
contains F but this is forbidden. (Note that the subsequence aab in v forces 
the first a and the b to appear in two distinct segments U and thus it gives 
incidences of E a and Ef, with two distinct vertices.) Hence w )f u(q,2p) and 
we can apply (J7J): 

i(H) = \v\ < (q — l)n +\w\ < (q — l)n + e ■ f3(q, 2p, e). 

□ 

We show that for the partition 



F = if 2 = ({l,3,5},{2,4}) = 



the factor at e(H) in ([T2|l must be 3> a(e(H)). We proceed as in the proof 
of g(n) = gex(Go,n) 3> na(n) in (j2J) and take a 2-sparse sequence f over [n] 
such that f )f 12121, |u| ^> na(n), and v = I\ I 2 ■ ■ ■ I% n where every interval 
Ii consists of distinct terms. We define the hypergraph 

H = (Ei : i £ [n]) with Ei = {j G [2n] : i appears in Ij}. 

We have i(H) = \v\ 3> na(n), \JH = [2n], v(H) = 2n, and e(H) = n. It is 
clear that H )/- H2 because v )f 12121. 

Taking in Theorem 14. II H to be simple and with the maximum weight, 
we obtain as a corollary that if F is a partition, p = v(F), and q = e(F) > 1, 
then 

eXj(F, n) < (q - l)n + ex e (F, n) ■ (3(q, 2p, ex e (F, n)). 
But Theorem 12.31 when it applies, gives better bounds. 
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Our last theorem generalizes in two ways the upper bound in (J2|i. We 
consider a class of forbidden forests that contains Go as a member and we 
extend by means of Theorems 13.11 and 12.31 the almost linear upper bound to 
hypergraphs. The class consists of star forests which are forests F with this 
structure: U F = A U B for some sets A < B such that every vertex in B has 
degree 1 and every edge of F connects A and B. Thus F is a star forest iff 
every component of F is a star and every central vertex of a star is smaller 
than every leaf. 

Theorem 4.2 Let F be a star forest with r > 1 components, p vertices, and 
q = p — r edges. Let t — (p — l)(q — 1) + 1 and(3(k,l,n) be the almost constant 
function defined in (Q|). We have the following bounds. 

1. gex(F, n) < (r — l)n + n ■ /3(r, 2q, n). 

2. ex e (F, n) < n ■ (3(r, 2tq, n) 3 . 

3. eXi(F, n) < n • @(r, 2tq, n) 3 . 

Proof. 1. We mark the centers of the stars in F with 1,2, ... , r according to 
their order and give the leaves of any star the mark of its center. The marks 
on all leaves then form a sequence u over [r] of length p — r. Now let G be 
any simple graph with (J G = [n] and G )f F . We consider the sequence 

v = hl 2 ...I n 

where Ij is any ordering of the set {i G [n] : {i,j} G G, i < j}. As in 
the previous proof, we select an r-sparse subsequence w of v with length 
M > \v\ — (r — l)(n — 1). Suppose that w y u(r, 2{p — r)) where u(k, I) is 
defined in ©. Then w has a (not necessarily consecutive) subsequence z of 
the form 

Q,\Cb2 ■ ■ ■ CL r CLiQi2 • • ■ &r • • • 0\CL2 ■ ■ ■ Q>r 

with 2(p — r) segments a^a2 ■ ■ ■ a r - We have < a i2 < . . . < ai r for a 
permutation z 1( i 2 , . . . , i r of [r]. We label every term a^. in z with j. Clearly, 
if we select one term from the 2-nd, 4-th, . . ., 2(p — r)-th segment of z 
so that the labels on the selected terms coincide with the sequence u, the 
selected terms lie in p — r distinct intervals Ij 1 , . . . , Ij p _ r , j\ < ... < j p - r . 
Since the selected terms are preceded by one segment aia 2 ■ ■ ■ a r , we have 
a ir < j\. The edges between a^, . . . , a r and ji, . . . ,j p - r which corespond to 
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the selected terms form an F-copy in G but G y F is forbidden. Therefore 
w )f- u(r, 2(p — r)) and we can apply (J7J): 

e(G) = \v\ < (r — l)n + |u>| < (r — l)n + n • /3(r, 2(p — r),n). 

2. Suppose that F has the vertex set [p] (so that [r] are the centers of 
the stars and [r + l,p] are the leaves). For k e N we denote -F(ft) the star 
forest with the vertex set [r + (p — r)k] in which [r] are again the centers of 
stars and for i = 1, 2, . . . , p — r the vertices in [r + (i — 1)A; + 1, r + ifc] are 
joined to the same vertex in [r] as r + % is joined in F. It is easy to see that 
F(t) = F((p-l)(g-l)+l) isap-l-blow-upofF. Also, e(F(k)) = kq. We set 
-B = {-F(t)} and use (|TT|) with the bounds gex(F, n) <C n ■ /3(r, 2q, n) = n- f3' 
(bound 1 for F), f(n) = c/3(r, 2tq, n) = c/3 for a constant c > (bound 1 for 
F(t)), and ex e (F,n) < 2 n (trivial): 

ex e (F, n) < n ■ ■ 2 W < n ■ 2 2 ( c+1 ^. 

The second application of (fTTj) gives 

ex e (F, n) < n ■ (5' ■ f3 ■ 2 2 ( c + 1 )-' 3 (^2c/m) « n . ^3 

because f3' < (3 and 

/3 (r, 2tg, x) -C log log x 

(this is true with any number of logarithms). 

3. This follows from 2 by Theorem 12.31 □ 

The lower bound in (j2J) shows that in general the factor at n in 1, 2, and 3 
of the previous theorem cannot be replaced with a constant and may be as 
big as ^> a(n). The proved bounds hold also for the reversals of star forests. 

5 Concluding remarks 

It is reasonable to call a function / : N — > R nearly linear if n l ~ e f(n) 
n i+e hoidg f or every e > 0. We identify a candidate for the class of all hyper- 
graphs F with nearly linear ex e (F,n). If F is isomorphic to the hypergraph 
({1}, {2}, . . . , {k}), then ex e (F, n) is eventually constant ([IE]) and thus is 
not nearly linear. For other hypergraphs we have ex e (F,n) > n because 
F -/{ ({!}, {2}, . . . , {n}). An orderly bipartite forest is a simple graph F such 
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that F has no cycle and min E < max E' holds for every two edges of F. In 
other words, F is a forest and there is a partition [j F = A U B such that 
A < B and every edge of F connects A and B. We denote the class of all 
orderly bipartite forests by OBF. We say that F is an orderly bipartite forest 
with singletons if F = F\ U F2 where F\ G OBF and F2 is a hypergraph 
consisting of possibly repeating singleton edges. For example, F may be 

F = ({8}, {6} 1; {6} 2 , {2}, {1,6}, {3, 6}, {4, 5}, {4, 7}). 

The class OBF subsumes star forests and their reversals. G\ defined in Q 
belongs to OBF but is neither a star forest nor a reversed star forest. 

Lemma 5.1 If the hypergraph F is not an orderly bipartite forest with sin- 
gletons, then there is a constant 7 > 1 such that 

ex e (F, n) 3> n 1 

and hence ex e (F,n) is not nearly linear. 

Proof. If F is not an orderly bipartite forest with singletons, then F has (i) 
an edge with more than two elements or (ii) two separated two-element edges 
or (iii) a two-path isomorphic to ({1, 2}, {2, 3}) or (iv) a repeated two-element 
edge or (v) an even cycle of two-element edges (odd cycles are subsumed in 
(iii)). In the cases (i)-(iv) we have ex e (F,n) 3> n 2 because the complete 
bipartite graph with parts [|_ n /2_|] and [[^/2J + l,n] does not contain F. 
As for the case (v), an application of the probabilistic method (Erdos jH]) 
provides an unordered graph that has n vertices, ^> n 1+1 ^ k edges, and no 
even cycle of length k. Thus, in the case (v), ex e (F,n) ^> n 1+1 ^ k for some 
k e N. □ 

We conjecture that ex e (F,n) is nearly linear if and only if F is an orderly 
bipartite forest with singletons that is not isomorphic to ({1}, {2}, ...,{&}). 
Since every orderly bipartite forest with singletons is contained in some or- 
derly bipartite forest, it suffices to consider only orderly bipartite forests. 

Problem 5.2 Prove (or disprove) that for every orderly bipartite forest F 
we have 

ex e (F, n) <C n(logn) c 

for some constant c > 0. 
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It is not difficult to find for every F E OBF and k E N an F' E OBF that 
is a fc-blow-up of F. Thus the previous bound would follow by Theorem 13.11 
from the graph bound gex(F,n) <C n(logn) c . 

It is natural to consider two subclasses OBF' C OBF Q C OBF where 
OBF' consists of all F E OBF with ex e (F, n) < n and OBF" consists of all 
F E OBF with ex. e (F, n) <C n-f(a(n)) for a primitive recursive function f(n). 
Both inclusions are strict as witnessed by G and G\ (defined in (JTJ) and (J2J)). 
In this article we ignored the class OBF z completely and showed that OBF a 
contains all star forests (and their reversals). It would much interesting to 
learn more about OBF' and OBF a . Does the latter class consist only of star 
forests and their reversals? 
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